Picture for Bernard Ghanem

Bernard Ghanem

Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think

Add code
Apr 29, 2025
Viaarxiv icon

Towards Faster and More Compact Foundation Models for Molecular Property Prediction

Add code
Apr 28, 2025
Viaarxiv icon

Action Anticipation from SoccerNet Football Video Broadcasts

Add code
Apr 16, 2025
Viaarxiv icon

SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning

Add code
Apr 08, 2025
Viaarxiv icon

SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning

Add code
Apr 01, 2025
Viaarxiv icon

Can Video Diffusion Model Reconstruct 4D Geometry?

Add code
Mar 27, 2025
Viaarxiv icon

BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding

Add code
Mar 27, 2025
Viaarxiv icon

Structured-Noise Masked Modeling for Video, Audio and Beyond

Add code
Mar 20, 2025
Viaarxiv icon

TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos

Add code
Mar 09, 2025
Viaarxiv icon

DiffCLIP: Differential Attention Meets CLIP

Add code
Mar 09, 2025
Viaarxiv icon